Parallel Monte-Carlo tree search for HPC systems and its application to computer go

نویسنده

  • Lars Schäfers
چکیده

Monte-Carlo Tree Search (MCTS) is a class of simulation-based search algorithms. It brought about great success in the past few years regarding the evaluation of deterministic two-player games such as the Asian board game Go. A breakthrough was achieved in 2006, when Rémi Coulom placed 1st at the ICGA Computer Go Olympiad in Turin with his MCTS based Go programm CrazyStone in the 9× 9 devision. Until today, MCTS highly dominates over traditional methods such as αβ search in the field of Computer Go. Over the years, MCTS found applications in several search domains. A recent survey of MCTS methods lists almost 250 MCTS related publications originating only from the last seven years, which demonstrates the popularity and importance of MCTS. It is currently emerging as a powerful tree search algorithm yielding promising results in many search domains such as connection games Hex and Havannah, combinatorial games Breakthrough and Amazons as well as General Game Playing and real-time games. Apart from games, MCTS finds applications in combinatorial optimization, constraint satisfaction, scheduling problems, sample-based planning and procedural content generation. In this thesis, we present a parallelization of the most popular MCTS variant for large HPC compute clusters that efficiently shares a single game tree representation in a distributed memory environment and scales up to 128 compute nodes and 2048 cores. It is hereby one of the most powerful MCTS parallelizations to date. We empirically confirmed its performance with extensive experiments and showed our parallelization’s power in numerous competitions with solutions of other research teams around the world. In order to measure the impact of our parallelization on the search quality and remain comparable to the most advanced MCTS implementations to date, we implemented it in a state-of-the-art Go engine Gomorra, making it competitive with the strongest Go programs in the world.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Monte-Carlo Tree Search for HPC Systems

Monte-Carlo Tree Search (MCTS) is a simulation-based search method that brought about great success to applications such as ComputerGo in the past few years. The power of MCTS strongly depends on the number of simulations computed per time unit and the amount of memory available to store data gathered during simulation. High-performance computing systems such as large compute clusters provide v...

متن کامل

Exploration exploitation in Go: UCT for Monte-Carlo Go

Algorithm UCB1 for multi-armed bandit problem has already been extended to Algorithm UCT which works for minimax tree search. We have developed a Monte-Carlo program, MoGo, which is the first computer Go program using UCT. We explain our modifications of UCT for Go application, among which efficient memory management, parametrization, ordering of non-visited nodes and parallelization. MoGo is n...

متن کامل

Monte Carlo Tree Search in Go

Monte Carlo Tree Search (MCTS) was born in Computer Go, i.e. in the application of artificial intelligence to the game of Go. Since its creation, in 2006, many improvements have been published. Programs are still by far weaker than the best human players, yet the gap was very significantly reduced. MCTS is now widely applied in games, in particular when no trustable evaluation function is readi...

متن کامل

Application of MCTS in Tsumego of Computer Go

The Tsumego problem in Go was a basic and essential problem to be overcome in implementing a computer Go program. This paper proposed a reality of Monte-Carlo tree search in Tsumego of computer Go which using Monte-Carlo evaluation as an alternative for a positional evaluation function. The advantage of this technique was that it requires few domain knowledge or expert input.

متن کامل

A Parallel Monte-Carlo Tree Search Algorithm

Monte-Carlo tree search is a powerful paradigm for the game of Go. We present a parallel Master-Slave algorithm for Monte-Carlo tree search. We experimented the algorithm on a network of computers using various configurations: from 12,500 to 100,000 playouts, from 1 to 64 slaves, and from 1 to 16 computers. On our architecture we obtain a speedup of 14 for 16 slaves. With a single slave and fiv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014